56 research outputs found

    Extension of Lander-Waterman theory for sequencing filtered DNA libraries

    Get PDF
    BACKGROUND: The degree to which conventional DNA sequencing techniques will be successful for highly repetitive genomes is unclear. Investigators are therefore considering various filtering methods to select against high-copy sequence in DNA clone libraries. The standard model for random sequencing, Lander-Waterman theory, does not account for two important issues in such libraries, discontinuities and position-based sampling biases (the so-called "edge effect"). We report an extension of the theory for analyzing such configurations. RESULTS: The edge effect cannot be neglected in most cases. Specifically, rates of coverage and gap reduction are appreciably lower than those for conventional libraries, as predicted by standard theory. Performance decreases as read length increases relative to island size. Although opposite of what happens in a conventional library, this apparent paradox is readily explained in terms of the edge effect. The model agrees well with prototype gene-tagging experiments for Zea mays and Sorghum bicolor. Moreover, the associated density function suggests well-defined probabilistic milestones for the number of reads necessary to capture a given fraction of the gene space. An exception for applying standard theory arises if sequence redundancy is less than about 1-fold. Here, evolution of the random quantities is independent of library gaps and edge effects. This observation effectively validates the practice of using standard theory to estimate the genic enrichment of a library based on light shotgun sequencing. CONCLUSION: Coverage performance using a filtered library is significantly lower than that for an equivalent-sized conventional library, suggesting that directed methods may be more critical for the former. The proposed model should be useful for analyzing future projects

    Novel and nodulation-regulated microRNAs in soybean roots

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Small RNAs regulate a number of developmental processes in plants and animals. However, the role of small RNAs in legume-rhizobial symbiosis is largely unexplored. Symbiosis between legumes (e.g. soybean) and rhizobia bacteria (e.g. <it>Bradyrhizobium japonicum</it>) results in root nodules where the majority of biological nitrogen fixation occurs. We sought to identify microRNAs (miRNAs) regulated during soybean-<it>B. japonicum </it>symbiosis.</p> <p>Results</p> <p>We sequenced ~350000 small RNAs from soybean roots inoculated with <it>B. japonicum </it>and identified conserved miRNAs based on similarity to miRNAs known in other plant species and new miRNAs based on potential hairpin-forming precursors within soybean EST and shotgun genomic sequences. These bioinformatics analyses identified 55 families of miRNAs of which 35 were novel. A subset of these miRNAs were validated by Northern analysis and miRNAs differentially responding to <it>B. japonicum </it>inoculation were identified. We also identified putative target genes of the identified miRNAs and verified <it>in vivo </it>cleavage of a subset of these targets by 5'-RACE analysis. Using conserved miRNAs as internal control, we estimated that our analysis identified ~50% of miRNAs in soybean roots.</p> <p>Conclusion</p> <p>Construction and analysis of a small RNA library led to the identification of 20 conserved and 35 novel miRNA families in soybean. The availability of complete and assembled genome sequence information will enable identification of many other miRNAs. The conserved miRNA loci and novel miRNAs identified in this study enable investigation of the role of miRNAs in rhizobial symbiosis.</p

    The C-Fern (Ceratopteris richardii) Genome: Insights Into Plant Genome Evolution With the First Partial Homosporous Fern Genome Assembly

    Get PDF
    Ferns are notorious for possessing large genomes and numerous chromosomes. Despite decades of speculation, the processes underlying the expansive genomes of ferns are unclear, largely due to the absence of a sequenced homosporous fern genome. The lack of this crucial resource has not only hindered investigations of evolutionary processes responsible for the unusual genome characteristics of homosporous ferns, but also impeded synthesis of genome evolution across land plants. Here, we used the model fern species Ceratopteris richardii to address the processes (e.g., polyploidy, spread of repeat elements) by which the large genomes and high chromosome numbers typical of homosporous ferns may have evolved and have been maintained. We directly compared repeat compositions in species spanning the green plant tree of life and a diversity of genome sizes, as well as both short- and long-read-based assemblies of Ceratopteris. We found evidence consistent with a single ancient polyploidy event in the evolutionary history of Ceratopteris based on both genomic and cytogenetic data, and on repeat proportions similar to those found in large flowering plant genomes. This study provides a major stepping-stone in the understanding of land plant evolutionary genomics by providing the first homosporous fern reference genome, as well as insights into the processes underlying the formation of these massive genomes

    Deep expression analysis reveals distinct cold-response strategies in rubber tree (hevea brasiliensis)

    Get PDF
    Natural rubber, an indispensable commodity used in approximately 40,000 products, is fundamental to the tire industry. The rubber tree species Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell-Arg., which is native the Amazon rainforest, is the major producer of latex worldwide. Rubber tree breeding is time consuming, expensive and requires large field areas. Thus, genetic studies could optimize field evaluations, thereby reducing the time and area required for these experiments. In this work, transcriptome sequencing was used to identify a full set of transcripts and to evaluate the gene expression involved in the different cold-response strategies of the RRIM600 (cold-resistant) and GT1 (cold-tolerant) genotypes.ResultsWe built a comprehensive transcriptome using multiple database sources, which resulted in 104,738 transcripts clustered in 49,304 genes. The RNA-seq data from the leaf tissues sampled at four different times for each genotype were used to perform a gene-level expression analysis. Differentially expressed genes (DEGs) were identified through pairwise comparisons between the two genotypes for each time series of cold treatments.DEG annotation revealed that RRIM600 and GT1 exhibit different chilling tolerance strategies. To cope with cold stress, the RRIM600 clone upregulates genes promoting stomata closure, photosynthesis inhibition and a more efficient reactive oxygen species (ROS) scavenging system. The transcriptome was also searched for putative molecular markers (single nucleotide polymorphisms (SNPs) and microsatellites) in each genotype. and a total of 27,111 microsatellites and 202,949 (GT1) and 156,395 (RRIM600) SNPs were identified in GT1 and RRIM600. Furthermore, a search for alternative splicing (AS) events identified a total of 20,279 events.ConclusionsThe elucidation of genes involved in different chilling tolerance strategies associated with molecular markers and information regarding AS events provides a powerful tool for further genetic and genomic analyses of rubber tree breeding20CONSELHO NACIONAL DE DESENVOLVIMENTO CIENTÍFICO E TECNOLΓ“GICO - CNPQCOORDENAÇÃO DE APERFEIΓ‡OAMENTO DE PESSOAL DE NÍVEL SUPERIOR - CAPESFUNDAÇÃO DE AMPARO Γ€ PESQUISA DO ESTADO DE SΓƒO PAULO - FAPESP478701/2012–8; 402954/2012Sem informação2007/50392–1; 2012/50491–8; 2014/18755–0; 2015/24346–

    Validation of reference transcripts in strawberry (<i>Fragaria</i> spp.)

    Get PDF
    Contemporary methods to assay gene expression depend on a stable set of reference transcripts for accurate quantitation. A lack of well-tested reference genes slows progress in characterizing gene expression in high-value specialty crops. In this study, a set of strawberry (Fragaria spp.) constitutively expressed reference genes has been identified by merging digital gene expression data with expression profiling. Constitutive reference candidates were validated using quantitative PCR and hybridization. Several transcripts have been identified that show improved stability across tissues relative to traditional reference transcripts. Results are similar between commercial octoploid strawberry and the diploid model. Our findings also show that while some never-before-used references are appropriate for most applications, even the most stable reference transcripts require careful assessment across the diverse tissues and fruit developmental states before being adopted as controls.Facultad de Ciencias ExactasInstituto de FisiologΓ­a Vegeta

    Transcriptomic Shock Generates Evolutionary Novelty in a Newly Formed, Natural Allopolyploid Plant

    Get PDF
    SummaryNew hybrid species might be expected to show patterns of gene expression intermediate to those shown by parental species [1, 2]. β€œTranscriptomic shock” may also occur, in which gene expression is disrupted; this may be further modified by whole genome duplication (causing allopolyploidy) [3–16]. β€œShock” can include instantaneous partitioning of gene expression between parental copies of genes among tissues [16–19]. These effects have not previously been studied at a population level in a natural allopolyploid plant species. Here, we survey tissue-specific expression of 144 duplicated gene pairs derived from different parental species (homeologs) in two natural populations of 40-generation-old allotetraploid Tragopogon miscellus (Asteraceae) plants. We compare these results with patterns of allelic expression in both inΒ vitro β€œhybrids” and hand-crossed F1Β hybrids between the parental diploids T. dubius and T.Β pratensis, and with patterns of homeolog expression in synthetic (S1) allotetraploids. Partitioning of expression was frequent in natural allopolyploids, but F1 hybrids and S1 allopolyploids showed less partitioning of expression than the natural allopolyploids and the inΒ vitro β€œhybrids” of diploid parents. Our results suggest that regulation of gene expression is relaxed in a concerted manner upon hybridization, and new patterns of partitioned expression subsequently emerge over the generations following allopolyploidization

    A physical map for the Amborella trichopoda genome sheds light on the evolution of angiosperm genome structure

    Get PDF
    Background: Recent phylogenetic analyses have identified Amborella trichopoda, an understory tree species endemic to the forests of New Caledonia, as sister to a clade including all other known flowering plant species. The Amborella genome is a unique reference for understanding the evolution of angiosperm genomes because it can serve as an outgroup to root comparative analyses. A physical map, BAC end sequences and sample shotgun sequences provide a first view of the 870 Mbp Amborella genome.Results: Analysis of Amborella BAC ends sequenced from each contig suggests that the density of long terminal repeat retrotransposons is negatively correlated with that of protein coding genes. Syntenic, presumably ancestral, gene blocks were identified in comparisons of the Amborella BAC contigs and the sequenced Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa genomes. Parsimony mapping of the loss of synteny corroborates previous analyses suggesting that the rate of structural change has been more rapid on lineages leading to Arabidopsis and Oryza compared with lineages leading to Populus and Vitis. The gamma paleohexiploidy event identified in the Arabidopsis, Populus and Vitis genomes is shown to have occurred after the divergence of all other known angiosperms from the lineage leading to Amborella.Conclusions: When placed in the context of a physical map, BAC end sequences representing just 5.4% of the Amborella genome have facilitated reconstruction of gene blocks that existed in the last common ancestor of all flowering plants. The Amborella genome is an invaluable reference for inferences concerning the ancestral angiosperm and subsequent genome evolution

    The Entomopathogenic Bacterial Endosymbionts Xenorhabdus and Photorhabdus: Convergent Lifestyles from Divergent Genomes

    Get PDF
    Members of the genus Xenorhabdus are entomopathogenic bacteria that associate with nematodes. The nematode-bacteria pair infects and kills insects, with both partners contributing to insect pathogenesis and the bacteria providing nutrition to the nematode from available insect-derived nutrients. The nematode provides the bacteria with protection from predators, access to nutrients, and a mechanism of dispersal. Members of the bacterial genus Photorhabdus also associate with nematodes to kill insects, and both genera of bacteria provide similar services to their different nematode hosts through unique physiological and metabolic mechanisms. We posited that these differences would be reflected in their respective genomes. To test this, we sequenced to completion the genomes of Xenorhabdus nematophila ATCC 19061 and Xenorhabdus bovienii SS-2004. As expected, both Xenorhabdus genomes encode many anti-insecticidal compounds, commensurate with their entomopathogenic lifestyle. Despite the similarities in lifestyle between Xenorhabdus and Photorhabdus bacteria, a comparative analysis of the Xenorhabdus, Photorhabdus luminescens, and P. asymbiotica genomes suggests genomic divergence. These findings indicate that evolutionary changes shaped by symbiotic interactions can follow different routes to achieve similar end points

    Maize Inbreds Exhibit High Levels of Copy Number Variation (CNV) and Presence/Absence Variation (PAV) in Genome Content

    Get PDF
    Following the domestication of maize over the past ∼10,000 years, breeders have exploited the extensive genetic diversity of this species to mold its phenotype to meet human needs. The extent of structural variation, including copy number variation (CNV) and presence/absence variation (PAV), which are thought to contribute to the extraordinary phenotypic diversity and plasticity of this important crop, have not been elucidated. Whole-genome, array-based, comparative genomic hybridization (CGH) revealed a level of structural diversity between the inbred lines B73 and Mo17 that is unprecedented among higher eukaryotes. A detailed analysis of altered segments of DNA conservatively estimates that there are several hundred CNV sequences among the two genotypes, as well as several thousand PAV sequences that are present in B73 but not Mo17. Haplotype-specific PAVs contain hundreds of single-copy, expressed genes that may contribute to heterosis and to the extraordinary phenotypic diversity of this important crop

    Utility of Different Gene Enrichment Approaches Toward Identifying and Sequencing the Maize Gene Space

    No full text
    Maize (Zea mays) possesses a large, highly repetitive genome, and subsequently a number of reduced-representation sequencing approaches have been used to try and enrich for gene space while eluding difficulties associated with repetitive DNA. This article documents the ability of publicly available maize expressed sequence tag and Genome Survey Sequences (GSSs; many of which were isolated through the use of reduced representation techniques) to recognize and provide coverage of 78 maize full-length cDNAs (FLCs). All 78 FLCs in the dataset were identified by at least three GSSs, indicating that the majority of maize genes have been identified by at least one currently available GSS. Both methyl-filtration and high-Cot enrichment methods provided a 7- to 8-fold increase in gene discovery rates as compared to random sequencing. The available maize GSSs aligned to 75% of the FLC nucleotides used to perform searches, while the expressed sequence tag sequences aligned to 73% of the nucleotides. Our data suggest that at least approximately 95% of maize genes have been tagged by at least one GSS. While the GSSs are very effective for gene identification, relatively few (18%) of the FLCs are completely represented by GSSs. Analysis of the overlap of coverage and bias due to position within a gene suggest that RescueMu, methyl-filtration, and high-Cot methods are at least partially nonredundant
    • …
    corecore